Synchronous input/output diagnostic controls

ABSTRACT

Aspects include communicating synchronous input/output (I/O) commands between a system and a recipient. The system executes on a processor coupled to a memory so as to issue, to the recipient, a synchronous I/O command with a diagnostic operation code, where the synchronous I/O command causes a gathering of diagnostic information by the recipient. Further, the system receives, from the recipient, status information including a diagnostic information identifier, the status information being in accordance with the diagnostic information.

BACKGROUND

The disclosure relates generally to input/output (I/O) on a computer,and more specifically, to synchronous I/O diagnostic controls.

In general, the technical field discussed herein includes communicationsbetween servers and storage control units over a storage area networkinvolving multiple switches and multiple layers of a protocol stack.Contemporary implementations of these communications between servers andstorage control units include asynchronous access operations byoperating systems within the storage area network. Asynchronous accessoperations require queues and schedulers for initiating the requests,along with interruptions for any associated context switch forprocessing a completion status. These queues, schedulers, andinterruptions amount to asynchronous overhead that adds significantlatency and processing delays across the storage area network.

Storage Area Networks (SANs), as described by the Storage NetworkingIndustry Association (SNIA), are high performance networks that enablestorage devices and computer systems to communicate with each other. Inlarge enterprises, multiple computer systems or servers have access tomultiple storage control units within the SAN. Typical connectionsbetween the servers and control units use technologies such as Ethernetor Fibre-Channel, with the associated switches, I/O adapters, devicedrivers and multiple layers of a protocol stack. Fibre-channel, forexample, as defined by the INCITS T11 Committee, defines physical andlink layers FC0, FC1, FC2 and FC-4 transport layers such as the FibreChannel Protocol (FCP) for SCSI and FC-SB-3 for Fibre Connectivity(FICON). There are many examples of synchronous and asynchronous I/Oaccess methods, each with their own advantages and disadvantages.Synchronous I/O causes a software thread to be blocked while waiting forthe I/O to complete, but avoids context switches and interrupts. Thisworks well when the I/O is locally attached with minimal access latency,but as access times increase, the non-productive processor overhead ofwaiting for the I/O to complete becomes unacceptable for largemulti-processing servers. The current state of the art for server accessto SAN storage, with its associated protocol over-head, is to useasynchronous I/O access methods. The large variation in access times,and even the minimum access times, of SAN storage with today's protocolssuch as Fibre-Channel, make synchronous I/O access unacceptable.

SUMMARY

In view of the above, embodiments of the present invention disclosedherein may include a synchronous system, method, and/or computer programproduct (herein synchronous system) that executes diagnostic mechanismsfor providing first failure data capture (FFDC) and diagnostic controlsfor input/output (I/O).

For instance, synchronous I/O eliminates many of the layers of storagearea network of contemporary implementations, thus improving efficiencywhile still providing the traditional enterprise qualities of dataintegrity and continuous availability with predictable and repeatablehigh performance. The diagnostic mechanisms herein provide FFDC togather enough diagnostic information from both ends of the synchronousI/O link to be able to diagnose problems at the first occurrence and nothave to ask clients to recreate the errors and add complicated traceequipment that can slow down the production work. The diagnosticmechanisms herein are available for use to both firmware and softwarelayers so as to provide a complete end-to-end view for a comprehensiveview of any abnormal condition.

Embodiments include a method for communicating synchronous input/output(I/O) commands between a system and a recipient. The method includesissuing, by the system to the recipient, a synchronous I/O command witha diagnostic operation code, where the synchronous I/O command causes agathering of diagnostic information by the recipient. The method furtherincludes receiving, by the system from the recipient, status informationincluding a diagnostic information identifier, where the statusinformation is in accordance with the diagnostic information. Thismethod can provide the ability to eliminate many of the layers of astorage area network of contemporary implementations, thus improvingefficiency while still providing the traditional enterprise qualities ofdata integrity and continuous availability with predictable andrepeatable high performance.

Methods can also include a storage control unit as the recipient. Thiscan provide the ability to eliminate a layer of the storage area networkof contemporary implementations.

Methods can also include identifying an abnormal condition. This canprovide the ability to provide a complete comprehensive end-to-end viewof the system for any abnormal condition.

Methods can also include a firmware as the system that identifies theabnormal condition and issues the synchronous I/O command independent ofan operating system in response to the identifying the abnormalcondition. This can provide the ability to eliminate a layer of thestorage area network of contemporary implementations.

Methods can also include an operating system as the system thatidentifies an abnormal condition. This can provide the ability toprovide a complete comprehensive end-to-end view of the system for anyabnormal condition while eliminating a layer of the storage area networkof contemporary implementations.

Methods can also include identifying an I/O error, a protocol violation,an invalid response, an unrecognized command, or time-out condition asthe abnormal condition. This can provide the ability to provide acomplete comprehensive end-to-end view of the system for any abnormalcondition.

Methods can also include issuing the synchronous I/O command to notifythe recipient of an abnormal condition. This can provide the ability toprovide a complete comprehensive end-to-end view of the system for anyabnormal condition while eliminating a layer of the storage area networkof contemporary implementations.

Methods can also include gathering of the diagnostic information bypolling a status area by the recipient. This can provide the ability toeliminate a layer of the storage area network of contemporaryimplementations.

Methods can also include correlating the diagnostic information betweenthe system and the recipient in accordance with the diagnosticinformation identifier. This can provide the ability to eliminate alayer of the storage area network of contemporary implementations.

Additional embodiments include systems and computer program products toimplement the above methods.

Additional features and advantages are realized through the techniquesof the present invention. Other embodiments and aspects of the inventionare described in detail herein and are considered a part of the claimedinvention. For a better understanding of the invention with theadvantages and the features, refer to the description and to thedrawings.

BRIEF DESCRIPTION OF THE DRAWINGS

The subject matter which is regarded as the invention is particularlypointed out and distinctly claimed in the claims at the conclusion ofthe specification. The forgoing and other features, and advantages ofthe invention are apparent from the following detailed description takenin conjunction with the accompanying drawings in which:

FIG. 1 illustrates a communication schematic comparing synchronousinput/output (I/O) and traditional I/O in accordance with an embodiment;

FIG. 2 illustrates a block diagram of a system for performingsynchronous I/O in accordance with an embodiment;

FIG. 3 illustrates a block diagram of an environment including asynchronous I/O link interface in accordance with an embodiment;

FIG. 4 illustrates a block diagram of an environment for performingsynchronous I/O with respect to a mailbox command and read operation inaccordance with an embodiment;

FIG. 5 illustrates a block diagram of an environment for performingsynchronous I/O with respect to a write operation in accordance with anembodiment;

FIG. 6 illustrates a block diagram of an environment performingsynchronous I/O with respect to a status operation in accordance with anembodiment;

FIG. 7 illustrates a process flow of a synchronous system with respectto diagnostic controls of synchronous I/O commands in accordance with anembodiment;

FIG. 8 illustrates another process flow of a synchronous system withrespect to diagnostic controls of synchronous I/O commands in accordancewith an embodiment; and

FIG. 9 illustrates a processing system in accordance with an embodimentof the present invention.

DETAILED DESCRIPTION

In view of the above, embodiments of the present invention disclosedherein may include a synchronous system, method, and/or computer programproduct (herein synchronous system) that executes diagnostic mechanismsfor providing first failure data capture (FFDC) and diagnostic controlsfor input/output (I/O).

For instance, synchronous I/O eliminates many of the layers of storagearea network of contemporary implementations, thus improving efficiencywhile still providing the traditional enterprise qualities of dataintegrity and continuous availability with predictable and repeatablehigh performance. The diagnostic mechanisms herein provide FFDC togather enough diagnostic information from both ends of the synchronousI/O link to be able to diagnose problems at the first occurrence and nothave to ask clients to recreate the errors and add complicated traceequipment that can slow down the production work. The diagnosticmechanisms herein are available for use to both firmware and softwarelayers so as to provide a complete end-to-end view for a comprehensiveview of any abnormal condition.

Embodiments described herein include a new synchronous input/output(I/O) central processing unit (CPU) instruction or command that allowsboth the synchronous execution of read and write I/O operations, or theinitiation of an I/O operation and subsequent synchronous test forcompletion. In embodiments, a synchronous low latency protocol runningover an interface link such as peripheral component interconnect express(PCIe) is utilized to communicate directly between a processor (alsoreferred to herein as a server) and a storage subsystem. The storagesubsystem receives mailbox commands, delivered from the CPU over thePCIe link, that request the synchronous execution of read/writecommands. For a read command, if the data is not already in the controlunit cache the synchronous command ends unsuccessfully. The control unitcan initiate processing to asynchronously read the data into the controlunit cache so that it can quickly be read via traditional I/Oprocessing. If the data was in the cache, it can be transferred to thehost memory and the synchronous I/O CPU instruction completessuccessfully. Write operations can transfer data from host memory to thecontrol unit cache of one or more nodes within the control unit.

Embodiments of synchronous I/O described herein when compared totraditional I/O can be utilized to eliminate the overhead associatedwith a dispatcher, improve transactional latency, preserve contents ofL1 and L2 cache by eliminating a context switch, and can reduce lockcontention on data by reducing latency.

Described herein is a new dynamic I/O paradigm for storage that canperform both synchronous and asynchronous (or traditional) processingfrom the application/middleware point of view. When applied tomiddleware, such as DB2® from IBM®, this new paradigm can result infaster performance. Current versions of DB2 can only have one I/Ooperation writing to the log at any one time. This single threadedprocess is highly dependent on the latency of these log write commandsfor the total throughput that can be accommodated for processingtransactions. Embodiments of the synchronous I/O paradigm describedherein can reduce the amount of time that it takes to write to the log.Note, that multi-write technology (e.g., zHyperWrite® technologyproduced by IBM) provides a way of eliminating the latency required bysynchronous replication of data. The combination of embodimentsdescribed herein with the use of multi-write technology can provide theability to maintain continuous availability with multi-switch technology(e.g., HyperSwap technology produced by IBM) while getting the benefitsof synchronous I/O.

The workloads that run on the z/OS® (an OS from IBM) can typically seevery high cache hit ratios (e.g., 90 percent) for read operations. Thismeans that a high percent of the time the data is found in a dynamicrandom access memory (DRAM) cache in the storage subsystem (e.g., apersistent storage control unit), resulting in reduced I/O servicetimes. These high cache hit ratios can be the result of a long traditionof close integration of the I/O stack from the applications ability topass cache hints on I/O operations through optimized caching algorithmsin the storage subsystem. When the data required for a read request isin DRAM in the storage subsystem it is amenable to being transferred tothe host with the new synchronous I/O paradigm. If a cache miss occurs,the storage subsystem can initiate the process to bring the data intocache but synchronously notify the host to execute the I/O operationusing the traditional asynchronous model.

Utilizing embodiments described herein to avoid the un-dispatching andre-dispatching of critical applications and middleware, can lead to areduction in OS overhead and eliminate the L1 and L2 cache damage thatcan occur when a context switch occurs for a different application.Reducing the L1 and L2 cache damage and re-dispatching of work can leadto a significant reduction in CPU cost.

When embodiments are applied to DB2 executing on a z/OS platform,utilizing embodiments to accelerate read I/O and data base logging canreduce DB2 transactional latency and accelerate transaction processingon the z/OS platform.

In addition, improving database log throughput can reduce cost byrequiring fewer data sharing instances (LPARs, I/O connections, logdevices) to achieve the work load requirements. It can also avoidforcing clients to re-engineer the workloads in order to avoid hittingconstraints.

Turning now to FIG. 1, communication schematics 100 of a traditional I/Oand a synchronous I/O when updating data stored on a peripheral storagedevice are generally shown in accordance with embodiments. As shown onthe right side of FIG. 1, performing traditional I/O operations includesreceiving a unit of work request 124 at an operating system (OS) 122 ina logical partition (LPAR). The unit of work can be submitted, forexample, from an application or middleware that is requesting an I/Ooperation. As used herein the term “unit of work” refers to dispatchabletasks or threads.

In response to receiving the unit of work request, the OS 122 performsthe processing shown in block 104 to request a data record. Thisprocessing includes scheduling an I/O request by placing the I/O requeston a queue for the persistent storage control unit (CU) 102 thatcontains the requested data record 104, and then un-dispatching the unitof work. Alternatively, the application (or middleware) can receivecontrol back after the I/O request is scheduled to possibly performother processing, but eventually the application (or middleware)relinquishes control of the processor to allow other units of work to bedispatched and the application (or middleware) waits for the I/O tocomplete and to be notified when the data transfer has completed with orwithout errors.

When the persistent storage control unit (SCU) 102 that contains thedata record 104 is available for use and conditions permit, the I/Orequest is started by the OS issuing a start sub-channel instruction orother instruction appropriate for the I/O architecture. The channelsubsystem validates the I/O request, places the request on a queue,selects a channel (link) to the persistent SCU 102, and when conditionspermit begins execution. The I/O request is sent to a persistent SCU102, and the persistent SCU 102 reads the requested data record from astorage device(s) of the persistent SCU 102. The read data record alongwith a completion status message is sent from the persistent SCU 102 tothe OS 122. Once the completion status message (e.g., via an I/Ointerrupt message) is received by the OS 122, the OS 122 requests thatthe unit of work be re-dispatched by adding the unit of work to thedispatch queue. This includes re-dispatching the LPAR to process theinterrupt and retrieving, by the I/O supervisor in the OS, the statusand scheduling the application (or middleware) to resume processing.When the unit of work reaches the top of the dispatch queue, the unit ofwork is re-dispatched.

Still referring to the traditional I/O, once the data record is receivedby the OS 122, the OS 122 performs the processing in block 106 to updatethe data record that was received from the persistent SCU 102. At block108, the updated data record is written to the persistent SCU 102. Asshown in FIG. 1, this includes the OS 122 scheduling an I/O request andthen un-dispatching the instruction. The I/O request is sent to apersistent SCU 102, and the persistent SCU 102 writes the data record toa storage device(s) of the persistent SCU 102. A completion statusmessage (e.g., an interruption message) is sent from the persistent SCU102 to the OS 122. Once the completion status message is received by theOS 122, the OS 122 requests that the unit of work be re-dispatched byadding the unit of work to the dispatch queue. When the unit of workreaches the top of the dispatch queue, the unit of work isre-dispatched. At this point, the unit of work is complete. As shown inFIG. 1, the OS 122 can perform other tasks, or multi-task, while waitingfor the I/O request to be serviced by the persistent SCU 102.

The traditional I/O process is contrasted with a synchronous I/Oprocess. As shown in FIG. 1, performing a synchronous I/O includesreceiving a unit of work request at the OS 122. In response to receivingthe unit of work request, the OS 122 performs the processing shown inblock 114 which includes synchronously requesting a data record from thepersistent SCU 112 and waiting until the requested data record isreceived from the persistent SCU 112. Once the data record is receivedby the OS 122, the OS 122 performs the processing in block 116 to updatethe data record. At block 118, the updated data record is synchronouslywritten to the persistent SCU 112. A synchronous status message is sentfrom the persistent SCU 112 to the OS 122 to indicate the data has beensuccessfully written. At this point, the unit of work is complete. Asshown in FIG. 1, the OS 122 is waiting for the I/O request to beserviced by the persistent SCU 112 and is not performing other tasks, ormulti-tasking, while waiting for the I/O request to be serviced. Thus,in an embodiment, the unit of work remains active (i.e., it is notun-dispatched and re-dispatched) until the OS is notified that the I/Orequest is completed (e.g., data has been read from persistent SCU, datahas been written to persistent SCU, error condition has been detected,etc.).

Thus, as shown in FIG. 1, synchronous I/O provides an interface betweena server and a persistent SCU that has sufficiently low overhead toallow an OS to synchronously read or write one or more data records. Inaddition to the low overhead protocol of the link, an OS executing onthe server can avoid the scheduling and interruption overhead by using asynchronous command to read or write one or more data records. Thus,embodiments of synchronous I/O as described herein when compared totraditional I/O not only reduce the wait time for receiving data from apersistent SCU, they also eliminate steps taken by a server to servicethe I/O request. Steps that are eliminated can include theun-dispatching and re-dispatching of a unit of work both when a requestto read data is sent to the persistent SCU and when a request to writedata is sent to the persistent SCU. This also provides benefits inavoiding pollution of the processor cache that would be caused byun-dispatching and re-dispatching of work.

As used herein, the term “persistent storage control unit” or“persistent SCU” refers to a storage area network (SAN) attached storagesubsystem with a media that will store data that can be accessed after apower failure. As known in the art, persistent SCUs are utilized toprovide secure data storage even in the event of a system failure.Persistent SCUs can also provide backup and replication to avoid dataloss. A single persistent SCU is typically attached to a SAN andaccessible by multiple processors.

As used herein, the term “synchronous I/O” refers to a CPU synchronouscommand that is used to read or write one or more data records, suchthat when the command completes successfully, the one or more datarecords are guaranteed to have been transferred to or from thepersistent storage control unit into host processor memory.

Turning now to FIG. 2, a block diagram of a system 200 (e.g.,synchronous system) for performing synchronous I/O is generally shown inaccordance with an embodiment. The system 200 shown in FIG. 2 includesone or more application/middleware 210, one or more physical processors220, and one or more persistent SCUs 230. The application/middleware 210can include any application software that requires access to datalocated on the persistent SCU 230 such as, but not limited to arelational database manager 212 (e.g. DB2), an OS 214, a filesystem(e.g., z/OS Distributed File Service System z File System produced byIBM), a hierarchical database manager (e.g. IMS® produced by IBM), or anaccess method used by applications (e.g. virtual storage access method,queued sequential access method, basic sequential access method). Asshown in FIG. 2, the database manager 212 can communicate with an OS 214to communicate a unit of work request that requires access to thepersistent SCU 230. The OS 214 receives the unit of work request andcommunicates with firmware 224 located on the processor 220 to request adata record from the persistent SCU 230, to receive the data record fromthe persistent SCU 230, to update the received data record, to requestthe persistent SCU 230 to write the updated data record, and to receivea confirmation that the updated data recorded was successfully writtento the persistent SCU 230. The firmware 224 accepts the synchronousrequests from the OS 214 and processes them. Firmware 232 located on thepersistent SCU 230 communicates with the firmware 224 located on theprocessor 220 to service the requests from the processor 220 in asynchronous manner.

As used herein, the term “firmware” refers to privileged code running onthe processor that interfaces with the hardware used for the I/Ocommunications; a hypervisor; and/or other OS software.

Embodiments described herein utilize peripheral component interconnectexpress (PCIe) as an example of a low latency I/O interface that may beimplemented by embodiments. Other low latency I/O interfaces, such as,but not limited to Infiniband™ as defined by the InfiniBand TradeAssociation and zSystems coupling links can also be implemented byembodiments.

Turning now to FIG. 3, a block diagram of an environment 300 including asynchronous I/O link interface 305 is depicted in accordance with anembodiment. As shown in FIG. 3, the environment 300 utilizes thesynchronous I/O link interface 305 as an interface between a server(e.g., a system 310) and a persistent SCU (e.g., a persistent SCU 320).The synchronous I/O link interface 305 has sufficiently low latency andprotocol overhead to allow an OS of the system 310 to synchronously reador write one or more data records from the persistent SCU 320. Inaddition to the low protocol overhead of the link, the OS can avoid theoverhead associated with scheduling and interrupts by using asynchronous command via the synchronous I/O link interface 305 to reador write one or more data records. The synchronous I/O link interface305, for example, can be provided as an optical interface based on anyPCIe base specification (as defined by the PCI-SIG) using thetransaction, data link, and physical layers. The synchronous I/O linkinterface 305 may further include replay buffers and acknowledgementcredits to sustain full bandwidth.

The system 310 is configured to provide at least one synchronous I/Olink interface 305 having at least one synchronous I/O link 315 to allowconnection to at least one persistent SCU (e.g., persistent SCU 320). Itcan be appreciated that two or more synchronous I/O links 315 may berequired for each connection to a persistent SCU. It can also beappreciated that two or more synchronous I/O links 315 may supportswitch connections to a persistent SCU. In an exemplary embodiment,where PCIe is utilized, the system 310 comprises a PCIe root complex 330for the interface link 315, while the persistent SCU 320 comprises aPCIe endpoint 335 for the control unit synchronous I/O interface 305.

Turning now to FIG. 4, a block diagram of an environment 400 forperforming synchronous I/O with respect to a mailbox command and readoperation is depicted in accordance with an embodiment. As shown in FIG.4, the environment 400 includes a system 310 (e.g., includes theapplication/middleware 210 and processor 200 of FIG. 2) and a persistentSCU 320 (e.g., includes persistent CU 230 of FIG. 2). The system 310includes a LPAR 411 comprising memory locations for a data record 413and an associated suffix 415 and a status area 421 comprising a devicetable entry (DTE) 423 and a status field 425. DTE 423 is an example of adata structure used by the firmware to store the mappings, such as,between virtual addresses and physical addresses. Similarly, a functiontable entry (FTE) is an example of a data structure used by a functiontable to indicate access to a specified synchronous I/O link. Thepersistent SCU 320 includes at least one mailbox 440 and a data record450.

In operation, synchronous I/O commands issued by the OS of the system310 are processed by the firmware 224 to build a mailbox command 460that is forwarded to the persistent SCU 320. For example, uponprocessing a synchronization I/O command for the OS by a firmware of thesystem 310, the firmware prepares hardware of the system 310 and sendsthe mailbox command 460 to the persistent SCU 320. The mailbox command460 is sent to the persistent SCU 320 in one or more memory writeoperations (e.g., over PCIe, using a PCIe base mailbox address that hasbeen determined during an initialization sequence described below). Aplurality of mailboxes can be supported by the persistent SCU 320 foreach synchronous I/O link 305. A first mailbox location of the pluralityof mailboxes can start at the base mailbox address, with each subsequentmailbox location sequentially located 256-bytes after each other. Afterthe mailbox command 460 is sent, the firmware can poll the status area421 (e.g., a status field 425) for completion or error responses. Inembodiments, the status area 421 is located in privileged memory of thesystem 310 and is not accessible by the OS executing on the system 310.The status area 421 is accessible by the firmware on the system 310 andthe firmware can communicate selected contents (or information relatedto or based on contents) of the status area 421 to the OS (e.g., via acommand response block).

In general, a single mailbox command 460 is issued to each mailbox at atime. A subsequent mailbox command will not issue to a mailbox 440 untila previous mailbox command has completed or an error condition (such asa timeout, when the data is not in cache, error in the command requestparameters, etc.) has been detected. Successive mailbox commands for agiven mailbox 440 can be identified by a monotonically increasingsequence number. Mailboxes can be selected in any random order. Thepersistent SCU 320 polls all mailboxes for each synchronous I/O link 305and can process the commands in one or more mailboxes in any order. Inan embodiment, the persistent SCU 320 polls four mailboxes for eachsynchronous I/O link 305. Receipt of a new mailbox command with anincremented sequence number provides confirmation that the previouscommand has been completed (either successfully or in error by thesystem 310). In an embodiment, the sequence number is also used todetermine an offset of the status area 421. The mailbox command can beof a format that includes 128-bytes. The mailbox command can be extendedby an additional 64-bytes or more in order to transfer additional datarecords. In an embodiment, a bit in the mailbox command is set toindicate the absence or presence of the additional data records.

The mailbox command can further specify the type of data transferoperations, e.g., via an operation code. Data transfer operationsinclude read data and write data operations. A read operation transfersone or more data records from the persistent SCU 320 to a memory of thesystem 310. A write operation transfers one or more data records fromthe memory of the system 310 to the storage persistent SCU 320. Inembodiments, data transfer operations can also include requesting thatthe persistent SCU 320 return its World Wide Node Name (WWNN) to thefirmware in the server. In further embodiments, data transfer operationscan also request that diagnostic information be gathered and stored inthe persistent SCU 320.

In any of the data transfer operations the contents of the mailboxcommand can be protected using a cyclic redundancy check (CRC) (e.g., a32 bit CRC). In an embodiment, the mailbox command can be protected by achecksum. In an embodiment, if the persistent SCU 320 detects a checksumerror, a response code to indicate the checksum error is returned.Continuing with FIG. 4, a synchronous I/O read data record operationwill now be described. For instance, if a mailbox command 460 includesan operation code set to read, the persistent SCU 320 determines if thedata record or records 450 are readily available, such that the datatransfer can be initiated in a sufficiently small time to allow the readto complete synchronously. If the data record or records 450 are notreadily available (or if any errors are detected with this mailboxcommand 460), a completion status is transferred back to the system 310.If the read data records are readily available, the persistent SCU 320provides 462 the data record 450.

In an embodiment, the persistent SCU 320 processes the mailbox command460, fetches the data record 450, provides CRC protection, andtransfers/provides the data record 450 over the synchronous I/O link305. The persistent SCU 320 can provide 462 the data record 450 assequential memory writes over PCIe, using the PCIe addresses provided inthe mailbox command 460. Each data record may require either one or twoPCIe addresses for the transfer as specified in the mailbox command 460.For example, if length fields in the mailbox command indicate the datarecord is to be transferred in a single contiguous PCIe address range,only one starting PCIe address is required for each record, with eachsuccessive PCIe memory write using contiguous PCIe addresses. Inembodiments, the length fields specify the length in bytes of each datarecord to be transferred.

The data record 450 can include a data portion and a suffix storedrespectively on data record 413 and suffix 415 memory locations of thelogical partition 411 after the data record 450 is provided. The datarecord 413 can be count key data (CKD) or extended count key data(ECKD). The data record 413 can also be utilized under small computersystem interface (SCSI) standards, such as SCSI fixed block commands.Regarding the suffix, at the end of each data record 450, an additional4-bytes can be transferred comprising a 32-bit CRC that has beenaccumulated for all the data in the data record 450. The metadata of thesuffix 415 can be created by an operating system file system used formanaging a data efficiently. This can be transferred in the last memorywrite transaction layer packet along with the last bytes of the datarecord 450, or in an additional memory write.

In addition, a host bridge of the system 310 performs addresstranslation and protection checks (e.g., on the PCIe address used forthe transfers) and provides an indication in the DTE 423 to the firmwareof the system 310 when the data read 462 is complete. The host bridgecan also validate that the received CRC matches the value accumulated onthe data transferred. After the last data record and corresponding CRChave been initiated on the synchronous I/O link 305, the persistent SCU320 considers this mailbox command 460 complete and must be ready toaccept a new command in this mailbox 440.

In an exemplary embodiment, the system 310 will consider the mailboxcommand 450 complete when all the data records 450 have been completelyreceived and the corresponding CRC has been successfully validated. Forexample, the firmware performs a check of the status area 421 todetermine if the data read 462 was performed without error (e.g.,determines if the DTE 423 indicates ‘done’ or ‘error’). If the data read462 was performed without error and is complete, the firmware thencompletes the synchronous I/O command. The system 310 will also considerthe mailbox command 450 complete if an error is detected during the dataread 462 or CRC checking process, error status is received from thepersistent SCU 320, or the data read 462 does not complete within thetimeout period for the read operation.

Embodiments of the mailbox command can also include a channel imageidentifier that corresponds to a logical path previously initialized bythe establish-logical-path procedure, for example over a fibre-channelinterface. If the logical path has not been previously established, aresponse code corresponding to this condition can be written to thestatus area 421 to indicate that the logical path was not previouslyestablished.

The mailbox command block can also include a persistent SCU imageidentifier that corresponds to a logical path previously initialized bythe establish-logical-path procedure. If the logical path has not beenpreviously established, a response code corresponding to this conditioncan be written to the status area 421 to indicate that the logical pathwas not previously established.

The mailbox command block can also include a device address within thelogical control unit (e.g., a specific portion of the direct accessstorage device located in the storage control unit) that indicates theaddress of the device to which the mailbox command is directed. Thedevice address should be configured to the persistent SCU specified,otherwise the persistent SCU 320 can return a response code (e.g., tothe status area 421 in the system 310) to indicate this condition.

The mailbox command block can also include a link token that isnegotiated by the channel and the persistent SCU 320 each time thesynchronous I/O link is initialized. If the persistent SCU 320 does notrecognize the link token, it can return a value to the status area 421that indicates this condition.

The mailbox command block can also include a WWNN that indicates theWWNN of the persistent SCU to which the command is addressed. Inembodiments, it is defined to be the 64-bit IEEE registered nameidentifier as specified in the T11 Fibre-Channel Framing and Signaling 4(FC-FS-4) document. If the specified WWNN does not match that of thereceiving persistent SCU, then a response code indicating this conditionis returned to processor.

The mailbox command block can also include device specific informationthat is used to specify parameters specific to this command. Forexample, for enterprise disk attachment when a write or read isspecified by the operation code, device specific information can includethe prefix channel command. In another example, when the operation codespecifies that the command is a diagnostic command, the device specificinformation can include a timestamp representing the time at which thiscommand was initiated and a reason code.

The mailbox command can also include a record count that specifies thenumber of records to be transferred by this synchronous I/O command (ormailbox command).

When PCIe is being utilized with a mailbox command that includesmultiple 32 bit words, the mailbox command can include one or more PCIedata addresses in the following format: PCIe data address bits 63:32 inword “n” to specify the word-aligned address of the location in memory(e.g., in the processor) where data will be fetched for a write andstored for a read operation; and PCIe data addressing bits 31:2 in word“n+1”. In addition word n+1 can include an end or record bit that can beset to indicate that the last word specified is the last word of therecord that is to be read or written.

The mailbox command can also include a mailbox valid bit(s) thatindicates whether the mailbox command is valid and whether the entiremailbox command has been received.

In view of the above, a synchronous I/O write data record operation willnow be described with respect to FIG. 5 in accordance with anembodiment. As shown in FIG. 5, the environment 500 includes a system310 and a persistent SCU 320. The system 310 includes a logicalpartition 511 comprising memory locations for a data record 513 and asuffix 515 and a status area 521 comprising a DTE 523 and a status field525. The persistent SCU 320 includes at least one mailbox 540 and a datarecord 550 once written.

In operation, for example, upon processing a synchronization I/O commandfor the OS by a firmware of the system 310, the firmware prepareshardware of the system 310 and sends the mailbox command 560 to mailbox540 of the persistent SCU 320. As noted above, a plurality of mailboxescan be supported by the persistent SCU 320 for each synchronous I/O link305. Further, after the mailbox command 560 is sent, the firmware canpoll the status area 521 (e.g., a status field 525) for completion orerror responses.

If a mailbox command 560, issued to mailbox 540, includes an operationcode set to write, the persistent SCU 320 determines if it is able toaccept the transfer of the data record or records 550. If the persistentSCU 320 is not able to accept the transfer (or if any errors aredetected with this mailbox command 560), a completion status istransferred back to the system 310. If the persistent SCU 320 is able toaccept the transfer, the persistent SCU 320 issues memory read requests565 for the data.

In an embodiment, the persistent SCU 320 processes the mailbox command560 and issues a read request 565 over PCIe (using the PCIe addressesprovided in the mailbox command 560) to fetch the data including thedata record 513 and the suffix 515. In response to the read request 565,the host bridge of the system 310 performs address translation andprotection checks on the PCIe addresses used for the transfers.

Further, the system 310 responds with memory read responses 570 to theserequests. That is, read responses 570 are provided by the system 310over the synchronous I/O link 305 to the persistent SCU 320 such thatthe data record 550 can be written. Each data record may require eitherone or two PCIe addresses for the transfer as specified in the mailboxcommand 560. For example, if the length fields in the mailbox commandindicate the entire record can be transferred using a single contiguousPCIe address range, only one starting PCIe address is required for eachrecord, with each successive PCIe memory read request using contiguousPCIe addresses. At the end of each data record, the additional 8-byteswill be transferred consisting of the 32-bit CRC that has beenaccumulated for all the data in the record and optionally an LRC orother protection data that has also been accumulated. The total numberof bytes requested for each record can be 8-bytes greater than thelength of the record to include the CRC protection bytes and theadditional 4-bytes for a longitudinal redundancy check (LRC).

After the data and CRC/LRC protection bytes have been successfullyreceived, the persistent SCU 320 responds by issuing a memory write 572(e.g., of 8-bytes of data). The persistent SCU 320 considers thismailbox command 560 complete after initiating this status transfer andmust be ready to accept a new command in this mailbox 540. The system310 will consider the mailbox command 560 complete when the statustransfer has been received. For example, the firmware performs a checkof the status area 521 (e.g., determines if the DTE 523 indicates ‘done’or ‘error’). The system 310 will also consider the mailbox command 560complete if an error is detected during the data transfer, error statusis received from the persistent SCU 320, or the status is not receivedwithin the timeout period for this operation.

FIG. 6 illustrates a block diagram of an environment 600 performingsynchronous I/O with respect to a status operation in accordance with anembodiment. The status operation, for example, can be a completionstatus. As shown in FIG. 6, the environment 600 includes a system 310and a persistent SCU 320. The system 310 includes a logical partition611 and a status area 621 comprising one or more status fields 625. Thepersistent SCU 320 includes at least one mailbox 640.

In response to a status request 670, the completion status (as detectedand recorded by the persistent SCU 320) is transferred 672 by thepersistent SCU 310 to the system 310. In an embodiment where PCIe isutilized that status is transferred 672 to a 64-bit PCIe address that isoffset from a status base address specified during an initializationexchange sequence. The status offset can be calculated as indicated inEquation 1.Offset=(Node#*4096)+(Mailbox#*1024)+(Sequence#*256)  Equation 1

In embodiments, when the persistent SCU 320 completes a read operationsuccessfully, no status is written after the data transfer. Thesuccessful reception of the receive data with valid CRC is sufficientindication that the operation has completed successfully. Inembodiments, when the persistent SCU 320 completes a write operation,the status is written 672 after the write data has been successfullyreceived. In embodiments, when the persistent SCU 320 completes acommand other than a read operation or a write operation, or it isunable to complete a read or write operation successfully, it transfers672 status information to the server.

In embodiments, the status information can include a bit(s) thatindicates whether the status information is valid or invalid. The system310 can poll on this bit(s) looking for it to indicate valid statusinformation so that it knows that status information has been written.

The status information can also include an error status indicator thatindicates whether the mailbox command completed successfully (e.g.,write data was successfully written to the persistent SCU 320) or not(e.g., write of data was not successfully written to the persistent SCU320). In the event that the mailbox command was not successfullycompleted, the status information provides additional details about theerror that was encountered by the persistent SCU 320.

In the event of an error, the status information can include a recordnumber that specifies the record (if any) to which the statusinformation pertains.

In the event of an error, the status information can include a controlunit response code that indicates the reason for the synchronous I/Ooperation failure. Response codes can include, but are not limited toindications that: device-dependent data (e.g., invalid track) is notvalid, see response code qualifier for details; incorrect length (e.g.,length of data does not match record length); SCU device addressinvalid; device-dependent error status presented (e.g., data record notavailable); logical path not established; persistent SCU synchronous I/Obusy; read data not immediately available on persistent SCU; write dataspace not immediately available on persistent SCU; persistent SCU inrecovery; checksum error; invalid operation code; sequence number doesnot match (e.g., mailbox command dropped); link token does not match(e.g., link re-initialized); WWNN does not match (e.g., link connectionchanged); and/or invalid length.

In the event of an error, the status information can include apersistent SCU response code qualifier whose value may include either anarchitected value or a model or device dependent value that furtherdescribes the condition specified by the response code.

In the event of an error, the status information can include a WWNN ofthe persistent SCU returning the status information.

In the event of an error, the status information can include a controlunit timestamp that indicates when the status condition was detected.

In the event of an error, the status information can include adiagnostic information identifier that indicates that diagnosticinformation is available in the persistent SCU and can be uniquelyidentified by the value in this field.

In the event of an error, the status information can include devicespecific status.

Turning now to FIG. 7, a process flow 700 of a synchronous system withrespect to diagnostic controls of synchronous I/O commands is providedin accordance with an embodiment. The process flow 700 being at block705, where an operating system of the synchronous system issues asynchronous I/O command. The synchronous I/O command can also be issuedto any recipient, such as firmware, hypervisor, etc. In particular theoperating system specifies an operation code in the synchronous I/Ocommand to gather diagnostic information. The recipient then builds amailbox command with the diagnostic operation code, so that the storagecontrol unit can gather information such as dumps and traces to allowfor problem debug. In addition, if the firmware detects an errorcondition it may issue the mailbox command with the diagnostic operationcode independently without the operating systems involvement.

At block 710, the operating system gathers information with respect tothe problem that has been detected for which diagnostic information isto be gathered. This information includes, for example dumps and tracesof the operating system, and also a timestamp and a reason code thatindicates the type of problem encountered. The timestamp and reason codeare included as parameters in the synchronous I/O command request block.This information is also relayed to the storage control unit in themailbox command. Once the mailbox command has been received by thestorage control unit, it gathers diagnostic information relative to thesynchronous I/O diagnostic request. This includes dump and traceinformation internal to the storage control unit and also a timestamp atwhich the diagnostic information was gathered. At block 715, the storagecontrol unit also prepares a diagnostic information identifier that isreturned in the status response and can be used to correlate controlunit diagnostic information with the operating system diagnosticinformation.

Turning now to FIG. 8, a process flow 800 of a synchronous system withrespect to diagnostic controls of synchronous I/O commands is providedin accordance with an embodiment. The process flow 800 being at block805, where the synchronous system identifies an abnormal condition. Afirmware and/or an operating system of the synchronous system canidentify the abnormal condition. The abnormal condition can be an I/Oerror, protocol violation, invalid responses, unrecognized commands,time-out conditions, etc. The firmware and/or the operating system ofthe synchronous system can issue synchronous I/O commands to notifyother components of the synchronous system that an abnormal conditionwas identified. For example, if the operating system identifies anabnormal condition, the operating system can issue a synchronous I/Ocommand with a diagnostic operation code to notify the firmware and thestorage control unit of the condition.

At block 810, in response to identifying an abnormal condition, theoperating system can initiate the process of recording stateinformation. In addition to recording local state information within theoperating system, the operating system can issue a synchronous I/Odiagnostic command to the firmware that is then transferred to thestorage control unit in a mailbox command. This causes additional stateinformation to be recorded in the storage control unit, which can beidentified by a diagnostic identifier and a timestamp. This diagnosticidentifier and timestamp is written by the storage control unit to astatus area 421. At block 815 the status area 421 is polled by firmwareand then returned to the operating system in the synchronous I/O commandresponse block. At block 820, the synchronous system correlates thediagnostic information between the operating system, the firmware, andthe storage control unit.

Referring now to FIG. 9, there is shown an embodiment of a processingsystem 900 for implementing the teachings herein. In this embodiment,the processing system 900 has one or more central processing units(processors) 901 a, 901 b, 901 c, etc. (collectively or genericallyreferred to as processor(s) 901). The processors 901, also referred toas processing circuits, are coupled via a system bus 902 to systemmemory 903 and various other components. The system memory 903 caninclude read only memory (ROM) 904 and random access memory (RAM) 905.The ROM 904 is coupled to system bus 902 and may include a basicinput/output system (BIOS), which controls certain basic functions ofthe processing system 900. RAM is read-write memory coupled to systembus 902 for use by processors 901.

FIG. 9 further depicts an input/output (I/O) adapter 906 and a networkadapter 907 coupled to the system bus 902. I/O adapter 906 may be asmall computer system interface (SCSI) adapter that communicates with ahard disk 908 and/or tape storage drive 909 or any other similarcomponent. I/O adapter 906, hard disk 908, and tape storage drive 909are collectively referred to herein as mass storage 910. Software 911for execution on processing system 900 may be stored in mass storage910. The mass storage 910 is an example of a tangible storage mediumreadable by the processors 901, where the software 911 is stored asinstructions for execution by the processors 901 to perform a method,such as the process flows above. Network adapter 907 interconnectssystem bus 902 with an outside network 912 enabling processing system900 to communicate with other such systems. A screen (e.g., a displaymonitor) 915 is connected to system bus 902 by display adapter 916,which may include a graphics controller to improve the performance ofgraphics intensive applications and a video controller. In oneembodiment, adapters 906, 907, and 916 may be connected to one or moreI/O buses that are connected to system bus 902 via an intermediate busbridge (not shown). Suitable I/O buses for connecting peripheral devicessuch as hard disk controllers, network adapters, and graphics adapterstypically include common protocols, such as the Peripheral ComponentInterconnect (PCI). Additional input/output devices are shown asconnected to system bus 902 via an interface adapter 920 and the displayadapter 916. A keyboard 921, mouse 922, and speaker 923 can beinterconnected to system bus 902 via interface adapter 920, which mayinclude, for example, a Super I/O chip integrating multiple deviceadapters into a single integrated circuit.

Thus, as configured in FIG. 9, processing system 905 includes processingcapability in the form of processors 901, and, storage capabilityincluding system memory 903 and mass storage 910, input means such askeyboard 921 and mouse 922, and output capability including speaker 923and display 915. In one embodiment, a portion of system memory 903 andmass storage 910 collectively store an OS, such as the z/OS or AIX OSfrom IBM Corporation, to coordinate the functions of the variouscomponents shown in FIG. 9.

Technical effects and benefits of the embodiments herein provideadvantages over asynchronous/traditional I/O commands by avoidingoverhead of interrupt processing and dispatching.

For instance, asynchronous/traditional I/O commands include thedisadvantage that while waiting on an I/O operation to complete, aprocessor executes other productive work, causing overhead forun-dispatch and re-dispatch, context switch overhead with the I/Ointerrupt and the processor cache content change. In contrast,embodiments herein allows multiple synchronous I/O commands to beinitiated, thus allowing multiple synchronous I/O operations to begin,while also allowing additional work to be performed before resuming thecommand to determine when the I/O operation completes. Further,synchronous I/O commands allow an operating system to issue multiplesynchronous I/O commands to multiple targets or transfer multiplerecords to the same or different targets, to achieve parallelism, andthus improved performance over multiple operations.

In another example, traditional enterprise storage attachments, such asFiber Connection (FICON) and Fibre Channel Protocol (FCP), have multipleprotocol layers that require several hardware, firmware and softwarelevels of processing which cause overhead and add latency. In contrast,the synchronous I/O of embodiments herein eliminates many of theselayers, thus improving system efficiency while providing the enterprisequalities of service that includes end-to-end data integrity checking,in-band instrumentation and measurements, work load management andcontinuous availability with predictable and repeatable highperformance.

Embodiments described herein provide SAN attached external persistentstorage for synchronous access. In addition, embodiments provide thedynamic switching between synchronous I/O and asynchronous I/O access.Shareable external SAN storage typically will have a mix of short andlong running I/O operations which can utilized and benefit from thisability to dynamically switch between the synchronous and asynchronousselection. Embodiments also provide a means for notifying software whenthe data is not available for synchronous access and the dynamicswitching to asynchronous access.

Embodiments described herein provide a low-latency protocol for serverto SAN storage communication that allows synchronous I/O access with itsinherent advantages of avoiding context switches, interruptions andprocessor cache pollution, while also providing mechanisms for avoidingprocessor blocking when access times become too great. Mechanisms aredescribed for dynamic notification and selection of synchronous orasynchronous I/O access.

Thus, embodiments described herein are necessarily rooted in processingsystem to perform proactive operations for efficiently replicating dataacross multiple storage subsystems in order to provide continuousavailability to overcome problems specifically arising in the realm oftraditional I/O and storage subsystem failures.

The embodiments herein may be a system, a method, and/or a computerprogram product at any possible technical detail level of integration.The computer program product may include a computer readable storagemedium (or media) having computer readable program instructions thereonfor causing a processor to carry out aspects of the embodiments herein.

The computer readable storage medium can be a tangible device that canretain and store instructions for use by an instruction executiondevice. The computer readable storage medium may be, for example, but isnot limited to, an electronic storage device, a magnetic storage device,an optical storage device, an electromagnetic storage device, asemiconductor storage device, or any suitable combination of theforegoing. A non-exhaustive list of more specific examples of thecomputer readable storage medium includes the following: a portablecomputer diskette, a hard disk, a random access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), a static random access memory (SRAM), a portablecompact disc read-only memory (CD-ROM), a digital versatile disk (DVD),a memory stick, a floppy disk, a mechanically encoded device such aspunch-cards or raised structures in a groove having instructionsrecorded thereon, and any suitable combination of the foregoing. Acomputer readable storage medium, as used herein, is not to be construedas being transitory signals per se, such as radio waves or other freelypropagating electromagnetic waves, electromagnetic waves propagatingthrough a waveguide or other transmission media (e.g., light pulsespassing through a fiber-optic cable), or electrical signals transmittedthrough a wire.

Computer readable program instructions described herein can bedownloaded to respective computing/processing devices from a computerreadable storage medium or to an external computer or external storagedevice via a network, for example, the Internet, a local area network, awide area network and/or a wireless network. The network may comprisecopper transmission cables, optical transmission fibers, wirelesstransmission, routers, firewalls, switches, gateway computers and/oredge servers. A network adapter card or network interface in eachcomputing/processing device receives computer readable programinstructions from the network and forwards the computer readable programinstructions for storage in a computer readable storage medium withinthe respective computing/processing device.

Computer readable program instructions for carrying out operations ofthe embodiments herein may be assembler instructions,instruction-set-architecture (ISA) instructions, machine instructions,machine dependent instructions, microcode, firmware instructions,state-setting data, configuration data for integrated circuitry, oreither source code or object code written in any combination of one ormore programming languages, including an object oriented programminglanguage such as Smalltalk, C++, or the like, and procedural programminglanguages, such as the “C” programming language or similar programminglanguages. The computer readable program instructions may executeentirely on the user's computer, partly on the user's computer, as astand-alone software package, partly on the user's computer and partlyon a remote computer or entirely on the remote computer or server. Inthe latter scenario, the remote computer may be connected to the user'scomputer through any type of network, including a local area network(LAN) or a wide area network (WAN), or the connection may be made to anexternal computer (for example, through the Internet using an InternetService Provider). In some embodiments, electronic circuitry including,for example, programmable logic circuitry, field-programmable gatearrays (FPGA), or programmable logic arrays (PLA) may execute thecomputer readable program instructions by utilizing state information ofthe computer readable program instructions to personalize the electroniccircuitry, in order to perform aspects of the embodiments herein.

Aspects of the embodiments herein are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems), and computer program products according to embodiments of theinvention. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer readable program instructions.

These computer readable program instructions may be provided to aprocessor of a general purpose computer, special purpose computer, orother programmable data processing apparatus to produce a machine, suchthat the instructions, which execute via the processor of the computeror other programmable data processing apparatus, create means forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks. These computer readable program instructionsmay also be stored in a computer readable storage medium that can directa computer, a programmable data processing apparatus, and/or otherdevices to function in a particular manner, such that the computerreadable storage medium having instructions stored therein comprises anarticle of manufacture including instructions which implement aspects ofthe function/act specified in the flowchart and/or block diagram blockor blocks.

The computer readable program instructions may also be loaded onto acomputer, other programmable data processing apparatus, or other deviceto cause a series of operational steps to be performed on the computer,other programmable apparatus or other device to produce a computerimplemented process, such that the instructions which execute on thecomputer, other programmable apparatus, or other device implement thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The flowchart and block diagrams in the Figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the embodiments herein. In this regard, each block in theflowchart or block diagrams may represent a module, segment, or portionof instructions, which comprises one or more executable instructions forimplementing the specified logical function(s). In some alternativeimplementations, the functions noted in the blocks may occur out of theorder noted in the Figures. For example, two blocks shown in successionmay, in fact, be executed substantially concurrently, or the blocks maysometimes be executed in the reverse order, depending upon thefunctionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts or carry out combinations of special purpose hardwareand computer instructions.

The terminology used herein is for the purpose of describing particularembodiments only and is not intended to be limiting of the invention. Asused herein, the singular forms “a”, “an” and “the” are intended toinclude the plural forms as well, unless the context clearly indicatesotherwise. It will be further understood that the terms “comprises”and/or “comprising,” when used in this specification, specify thepresence of stated features, integers, steps, operations, elements,and/or components, but do not preclude the presence or addition of onemore other features, integers, steps, operations, element components,and/or groups thereof.

The descriptions of the various embodiments of the embodiments hereinhave been presented for purposes of illustration, but are not intendedto be exhaustive or limited to the embodiments disclosed. Manymodifications and variations will be apparent to those of ordinary skillin the art without departing from the scope and spirit of the describedembodiments. The terminology used herein was chosen to best explain theprinciples of the embodiments, the practical application or technicalimprovement over technologies found in the marketplace, or to enableothers of ordinary skill in the art to understand the embodimentsdisclosed herein.

What is claimed is:
 1. A computer program product, the computer programproduct comprising a computer readable storage medium having programinstructions for communicating synchronous input/output (I/O) commandsbetween a system and a recipient embodied therewith, the systemexecuting on a processor coupled to a memory, the program instructionsexecutable by the processor to cause the processor to perform: issuing,to the recipient, a synchronous I/O command with a diagnostic operationcode, the synchronous I/O command causing a gathering of diagnosticinformation by the recipient; receiving, from the recipient, statusinformation including a diagnostic information identifier, the statusinformation being in accordance with the diagnostic information; andidentifying an abnormal condition; wherein the system is a firmware, thefirmware identifying the abnormal condition, and wherein the issuing ofthe synchronous I/O command is performed by the firmware independentlyof an operating system in response to the identifying the abnormalcondition.
 2. The computer program product of claim 1, wherein therecipient is a storage control unit.
 3. The computer program product ofclaim 1, wherein the system is an operating system, the operating systemidentifying the abnormal condition.
 4. The computer program product ofclaim 1, wherein the abnormal condition is an I/O error, a protocolviolation, an invalid response, an unrecognized command, or a time-outcondition.
 5. The computer program product of claim 1, wherein theissuing of the synchronous I/O command notifies the recipient of anabnormal condition.
 6. The computer program product of claim 1, whereinthe gathering of the diagnostic information includes polling a statusarea by the recipient.
 7. The computer program product of claim 1,wherein the program instructions are further executable by the processorto cause the processor to perform: correlating the diagnosticinformation between the system and the recipient in accordance with thediagnostic information identifier.
 8. A system, comprising a processorand a memory storing program instructions for communicating synchronousinput/output (I/O) commands between a system and a recipient thereon,the program instructions executable by the processor to cause the systemto perform: issuing, to the recipient, a synchronous I/O command with adiagnostic operation code, the synchronous I/O command causing agathering of diagnostic information by the recipient; receiving, fromthe recipient, status information including a diagnostic informationidentifier, the status information being in accordance with thediagnostic information; identifying an abnormal condition; wherein thesystem is a firmware, the firmware identifying the abnormal condition,and wherein the issuing of the synchronous I/O command is performed bythe firmware independently of an operating system in response to theidentifying the abnormal condition.
 9. The system of claim 8, whereinthe recipient is a storage control unit.
 10. The system of claim 8,wherein the system is an operating system, the operating systemidentifying the abnormal condition.
 11. The system of claim 8, whereinthe abnormal condition is an I/O error, a protocol violation, an invalidresponse, an unrecognized command, or a time-out condition.
 12. Thesystem of claim 8, wherein the issuing of the synchronous I/O commandnotifies the recipient of an abnormal condition.
 13. The system of claim8, wherein the gathering of the diagnostic information includes pollinga status area by the recipient.
 14. The system of claim 8, wherein theprogram instructions are further executable by the processor to causethe system to perform: correlating the diagnostic information betweenthe system and the recipient in accordance with the diagnosticinformation identifier.